Wow! What a Useful Extension! Introducing Non-Referential Concepts to Wordnet
نویسندگان
چکیده
In this paper we present the ongoing efforts to expand the depth and breath of the Open Multilingual Wordnet coverage by introducing two new classes of non-referential concepts to wordnet hierarchies: interjections and numeral classifiers. The lexical semantic hierarchy pioneered by Princeton Wordnet has traditionally restricted its coverage to referential and contentful classes of words: such as nouns, verbs, adjectives and adverbs. Previous efforts have been employed to enrich wordnet resources including, for example, the inclusion of pronouns, determiners and quantifiers within their hierarchies. Following similar efforts, and motivated by the ongoing semantic annotation of the NTU-Multilingual Corpus, we decided that the four traditional classes of words present in wordnets were too restrictive. Though non-referential, interjections and classifiers possess interesting semantics features that can be well captured by lexical resources like wordnets. In this paper, we will further motivate our decision to include non-referential concepts in wordnets and give an account of the current state of this expansion.
منابع مشابه
Deriving Metonymic Coercions from WordNet
This paper presents a method for deriving metonymic coercions from the knowledge available in WordNet. Two different classes of metonymies are inferred by using (1) lexico-semantic connections between concepts or (2) morphological cues and logical formulae defining lexical concepts. In both cases the derivation of metonymic paths is based on approximations of sortal constraints retrieved from W...
متن کاملAn Analysis of Achievement of the Philosophical Sense of “Extension” in Music, with Interpretaion of Ibn-e Sina’s Explanation an Extension
This research can be considered as one of the studies that seek to explore, in an argumentative way, subtle and solid philosophical concepts in the field of art. The paper provides an analysis of the concept of “extension” in music as one of the most thought-provoking philosophical concepts. The analysis is carried out by interpreting Ibn-Sina’s special conception of musical extension to answer...
متن کاملDesiderata For Tagging With WordNet Synsets Or MCCA Categories
Minnesota Contextual Content Analysis (MCCA) is a technique for characterizing the concepts and themes occurring in text (sentences, paragraphs, interview transcripts, books). MCCA tags each word with a category and examines the distribution of categories against norms representing general usage of categories. MCCA also scores texts in terms of social contexts that are similar to different func...
متن کاملCombining Qualitative and Quantitative Keyword Extraction Methods with Document Layout Analysis
The large availability of documents in digital format posed the problem of efficient and effective retrieval mechanisms. This involves the ability to process natural language, which is a significantly complex task. Traditional algorithms based on term matching between the document and the query, although efficient, are not able to catch the intended meaning of both, and hence cannot ensure effe...
متن کاملIs it possible to create a very large wordnet in 100 days? An evaluation
Wordnets are large-scale lexical databases of related words and concepts, useful for language-aware software applications. They have recently been built for many languages by using various approaches. The Finnish wordnet, FinnWordNet (FiWN), was created by translating the more than 200,000 word senses in the English Princeton WordNet (PWN) 3.0 in 100 days. To ensure quality, they were translate...
متن کامل